Judging correlation from scatterplots and parallel coordinate plots

نویسندگان

  • Jing Li
  • Jean-Bernard Martens
  • Jarke J. van Wijk
چکیده

Received: 12 December 2007 Revised: 11 March 2008 Accepted: 16 March 2008 Abstract Scatterplots and parallel coordinate plots (PCPs) can both be used to assess correlation visually. In this paper, we compare these two visualization methods in a controlled user experiment. More specifically, 25 participants were asked to report observed correlation as a function of the sample correlation under varying conditions of visualization method, sample size and observation time. A statistical model is proposed to describe the correlation judgment process. The accuracy and the bias in the judgments in the different conditions are established by interpreting the parameters in this model. A discriminability index is proposed to characterize the performance accuracy in each experimental condition. Moreover, a statistical test is applied to derive whether or not the human sensation scale differs from a theoretically optimal (i.e., unbiased) judgment scale. Based on these analyses, we conclude that users can reliably distinguish twice as many different correlation levels when using scatterplots as when using PCPs. We also find that there is a bias towards reporting negative correlations when using PCPs. Therefore, we conclude that scatterplots are more effective than parallel plots in supporting visual correlation analysis. Information Visualization advance online publication, 1 May 2008; doi:10.1057/palgrave.ivs.9500179

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The parallel coordinate plot in action : design and use for geographic visualization 3

7 Implementations of interactive parallel coordinate plots in geographic visualization systems are presented. The plots represent spatial and spatio-temporal data, and are linked to maps and 9 scatterplots. The interactive features of the parallel coordinate representations are discussed, with particular emphasis on their ability to facilitate geographic data exploration and understand11 ing. T...

متن کامل

The parallel coordinate plot in action: design and use for geographic visualization

Implementations of interactive parallel coordinate plots in geographic visualization systems are presented. The plots represent spatial and spatio-temporal data, and are linked to maps and scatterplots. The interactive features of the parallel coordinate representations are discussed, with particular emphasis on their ability to facilitate geographic data exploration and understanding. The pape...

متن کامل

Enhancing scatterplot matrices for data with ordering or spatial attributes

The scatterplot matrix is one of the most common methods used to project multivariate data onto two dimensions for display. While each off-diagonal plot maps a pair of non-identical dimensions, there is no prescribed mapping for the diagonal plots. In this paper, histograms, 1D plots and 2D plots are drawn in the diagonal plots of the scatterplots matrix. In 1D plots, the data are assumed to ha...

متن کامل

ConnectedCharts: Explicit Visualization of Relationships between Data Graphics

Multidimensional multivariate data can be visualized using many different well-known charts, such as bar charts, stacked bar charts, grouped bar charts, scatterplots, or pivot tables, or also using more advanced highdimensional techniques such as scatterplot matrices (SPLOMs) or parallel coordinate plots (PCPs). These many techniques have different advantages, and users may wish to use several ...

متن کامل

Tracing Tuples Across Dimensions: A Comparison of Scatterplots and Parallel Coordinate Plots

One of the fundamental tasks for analytic activity is retrieving (i.e., reading) the value of a particular quantity in an information visualization. However, few previous studies have compared user performance in such value retrieval tasks for different visualizations. We present an experimental comparison of user performance (time and error distance) across four multivariate data visualization...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Information Visualization

دوره 9  شماره 

صفحات  -

تاریخ انتشار 2010